{
 "cells": [
  {
   "cell_type": "markdown",
   "id": "3b5c1f5d-1c27-4e2c-806a-d14054f0e3af",
   "metadata": {},
   "source": [
    "## ollama部署"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "6359f2cf-b650-4936-ae95-e153dcafc51e",
   "metadata": {},
   "source": [
    "|        |          | LLM                        | Embedding                        | Reranker                        |\n",
    "| ------ | -------- | -------------------------- | -------------------------------- | ------------------------------- |\n",
    "| Ollama | base_url | http://localhost:11434/v1/ | http://localhost:11434           | http://localhost:11434          |\n",
    "|        | api_key  | NA                         | NA                               | NA                              |\n",
    "|        | model    | qwen3:8B                   | dengcao/Qwen3-Embedding-0.6B:F16 | dengcao/Qwen3-Reranker-0.6B:F16 |\n",
    "\n"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "ea652e06-d652-4655-8376-3ba2b0e8e7f9",
   "metadata": {},
   "source": [
    "## vllm部署"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "74fef4dd-a103-42bf-8dd2-a408844d1b49",
   "metadata": {},
   "source": [
    "\n",
    "|      |          | LLM                      | Embedding                | Reranker                 |\n",
    "| ---- | -------- | ------------------------ | ------------------------ | ------------------------ |\n",
    "| vLLM | base_url | http://localhost:9992/v1 | http://localhost:8000/v1 | http://localhost:8001/v1 |\n",
    "|      | api_key  | token-abc123             | NA                       | NA                       |\n",
    "|      | model    | my_qwen3_14b             | Qwen3-Embedding-0.6B     | Qwen3-Reranker-0.6B      |"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "26d63752-6aca-496f-a5be-4e5cc97ae570",
   "metadata": {},
   "source": [
    "## Xinference部署"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "c571f7f5-87bf-487e-b607-105f1dd896f3",
   "metadata": {},
   "source": [
    "\n",
    "|            |          | LLM                      | Embedding             | Reranker              |\n",
    "| ---------- | -------- | ------------------------ | --------------------- | --------------------- |\n",
    "| xinference | base_url | http://localhost:9997/v1 | http://localhost:9997 | http://localhost:9997 |\n",
    "|            | api_key  | NA                       | NA                    | NA                    |\n",
    "|            | model    | my_qwen3_14b             | my_qwen_embed_0.6b    | my_qwen_reranker_0.6b |"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "5fd33858-62b7-4e62-bd88-5cac6c7f1974",
   "metadata": {},
   "source": [
    "## 硅基流动(免部署)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "dea18bf3-14a2-4619-a9fe-ac50a2ee925f",
   "metadata": {},
   "source": [
    "|             |          | LLM                                                         | Embedding                                                   | Reranker                             |\n",
    "| ----------- | -------- | ----------------------------------------------------------- | ----------------------------------------------------------- | ------------------------------------ |\n",
    "| siliconflow | base_url | https://api.siliconflow.cn/v1/chat/completions              | https://api.siliconflow.cn/v1/embeddings                    | https://api.siliconflow.cn/v1/rerank |\n",
    "|             | api_key  | Bearer sk-oyynmtyjrsguxrwqdrgyeepzackpwgdrndnzdlydxtjbswup- | Bearer sk-oyynmtyjrsguxrwqdrgyeepzackpwgdrndnzdlydxtjbswup- | Bearer sk-oyynmtyjrsguxrwqdrgyeepzackpwgdrndnzdlydxtjbswup-                                    |\n",
    "|             | model    | Qwen/Qwen3-8B                                               | BAAI/bge-m3                                                 | BAAI/bge-reranker-v2-m3              |"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "a9621566-7cea-4254-b52e-830890d9bcf9",
   "metadata": {},
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "95b20d0b-683f-4f16-9139-d55e5651395c",
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "",
   "name": ""
  },
  "language_info": {
   "name": ""
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}
